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ABSTRACT 

Background Familial Hypercholesterolemia (FH) is an 
autosomal dominant disease, caused by mutations in 
LDLR, APOB or PCSK9, which results in high levels of 
LDL-cholesterol (LDL-C) leading to early coronary heart 
disease. An autosomal recessive form of FH is also 
known, due to homozygous mutations in LDLRAP1. This 
study assessed the utility of an exome capture method 
and deep sequencing in FH diagnosis. 
Methods Exomes of 48 definite FH patients, with no 
mutation detected by current methods, were captured by 
Agilent Human All Exon 50Mb assay and sequenced on 
the lllumina HiSeq 2000 platform. Variants were called by 
GATK and SAMtools. 

Results The mean coverage of FH genes varied 
considerably (PCSK9=23x, LDLHAP1 =36x, LDLR=b§x 
mdAPOB=$3x). Exome sequencing detected 17 LDLR 
mutations, including three copy number variants, two 
APOB mutations, missed by the standard techniques, 
two LDLR novel variants likely to be FH-causing, and five 
APOB variants of uncertain effect. Two variants called in 
PCSK9 were not confirmed by Sanger sequencing. One 
heterozygous mutation was found in LDLRAP1. 
Conclusions High-throughput DNA sequencing 
demonstrated its efficiency in well-covered DNA regions, 
in particular LDLR. This highly automated technology is 
proving to be effective for heterogeneous diseases and 
may soon replace laborious conventional methods. 
However, the poor coverage of gene promoters and 
repetitive, or GC-rich sequences, remains problematic, 
and validation of all identified variants is currently 
required. 



INTRODUCTION 

Familial Hypercholesterolaemia (FH) is a common 
autosomal dominant genetic disease caused by 
mutations affecting the plasma clearance of 
LDL-cholesterol (LDL-C). 1 FH patients have ele- 
vated levels of total and low-density lipoprotein 
(LDL) from birth 7 and if untreated 7 develop coron- 
ary heart disease (CHD) by the age of 55 in 50% of 
men and 30% of women. 2 In addition to the 
increased LDL-C 7 a proportion of FH patients is 
characterised by the occurrence of tendon xantho- 
mas (TX) 7 and the UK Simon Broome criteria 3 clas- 
sifies TX-positive patients as Definite FH (DFH) 7 
and TX-negative patients as Possible FH (PFH) 7 
with the DFH group having a three times higher 
risk of developing CHD when compared with the 
PFH subjects 4 5 Statin therapy has been proven 
highly effective in the treatment of FH patients 7 
and the importance of an early identification of FH 



patients for the statin treatment has been demon- 
strated 6 and recently highlighted. 7 

The clinical phenotype of FH is known to be 
due to mutations in three genes encoding proteins 
involved in the clearance of LDL-C from the 
plasma 7 LDLR, APOB and PCSK9- There are over 
1200 different LDLR mutations 7 8 but only one 
common APOB (c.10580G>A 7 p.R3527Q) and one 
PCSK9 (c.ll20G>i; p.D374Y) mutation, reported 
in the UK population. 9 The majority of pathogenic 
LDLR variants are single nucleotide changes 
leading to significant alterations in the amino acid 
sequence of the mature protein 7 or creation of a 
truncated peptide. FH is also caused by variants 
that affect correct splicing 7 and by changes in the 
transcription-factor-binding elements located in 
the promoter. 10 

While LDLR/ APOB /PCSK9 mutations cause a 
dominant pattern of inheritance 7 an autosomal 
recessive hypercholesterolaemia (ARH) has also 
been observed. The locus for ARH was mapped to a 
chromosome 1 gene 7 the LDLRAP1 , in which both 
homozygous and compound heterozygous muta- 
tions can be found. 11 Most of the ARH-causing 
mutations are due to premature stop codons. 

In 20 08 7 the UK National Institute for Health 
and Clinical Excellence (NICE) guidelines were 
published 7 and included a recommendation that all 
FH patients be offered a DNA test to confirm their 
diagnosis and 7 so that mutation confirmation 
could be used 7 to cascade-test their first-degree rela- 
tives. Newly identified patients can then be offered 
statin treatment. 12 In most laboratories 7 FH muta- 
tion screening includes use of commercially avail- 
able kits designed to test for the most common 
mutations 7 such as Elucigene FH20 (Gen-Probe Life 
Sciences 7 UK) and LIPOchip (Progenica Biopharma 7 
Spain) , and for large gene rearrangements (dele- 
tions or duplications) , which account for 4%-5% 
of all FH mutations. 13 However 7 in the UK 7 due to 
the highly heterogeneous nature of the population 
this approach is not fully effective 7 and many 
patient samples require screening the LDLR pro- 
moter and coding regions 7 splice sites and splice 
branch points for causative mutations 7 and in the 
diagnostic laboratory this is currently performed 
using Sanger sequencing. Because of the time and 
labour of these methods 7 there has been interest in 
next-generation sequencing (NGS) technology for 
the diagnosis of genetic disorders. However ; 
whether NGS is ready for clinical use has been 
questioned. 14 Main limitations of the technology 
include the requirement for complex data analysis 7 
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significant computing infrastructure with respect to data ana- 
lysis and storage, and legal and ethical issues associated with 
incidental findings from acquiring whole exome data. In the 
research laboratory a four-phased approach is used to screen FH 
patients to identify the causative mutation, using the commer- 
cially available Amplification Refractory Mutation System kit, 
which tests for 20 of the most common UK mutations, fol- 
lowed by High-Resolution Melting (HRM) to detect changes 
within the coding region and splice sites of FH genes, followed 
by Multiplex Ligation-dependent Probe Amplification (MLPA), 
for the detection of large LDLR gene rearrangements, and 
finally Sanger sequencing. 15-17 Using standard molecular diag- 
nostic techniques, an FH-causing mutation can be detected in 
20%-30% of PFH patients, and in 60%-80% of DFH patients. 18 
The UK10K is a large-scale deep-sequencing project, based on 
collaboration between multiple investigators at the Wellcome 
Trust Sanger Institute, and clinical experts in different genetic 
diseases. A total of 125 FH samples with no LDLR/APOB/ 
PCSK9 mutation are currently in the exome sequencing pipe- 
line, as a part of the Rare Diseases group of the UK10K. The 
aim of the project is to provide collaborators with high-quality 
exome data, which will be used for the discovery of novel 
disease genes. This paper reports the sequencing results of the 
first 48 FH exomes, and discusses sensitivity problems of the 
current FH mutation-screening methods, as well as demonstrat- 
ing advantages and limitations of the whole exome sequencing 
approach. 

MATERIALS AND METHODS 
Patients' selection 

Forty-eight unrelated FH patients were selected from the 
Simon Broome FH register. 19 All individuals were Caucasian 
and attended a lipid clinic in London, Oxford or Manchester. 
Patients were diagnosed using the UK Simon Broome criteria as 
DFH on the basis of the presence or history of TX. The entire 
promoter and coding regions, including splice sites, of the 
LDLR gene were screened by the HRM method, as previously 
described, 16 on the Rotor- Gene (6000) real-time rotary analyser. 
Patients were screened for presence of the APOB mutation, p. 
(R3527Q), using a restriction enzyme digest, 20 and the entire 
coding region of the PCSK9 was examined by HRM. 17 
Fragments with a heterozygous melting curve were analysed 
further by direct sequencing. Screening for large rearrangements 
within the LDLR gene was done using the MLPA 21 SALSA 
P062 LDLR kit from MRC-Holland (Amsterdam). One hundred 
and ninety five non-FH Caucasian samples, sequenced in paral- 
lel with the FH cohort, as a part of the UK10K rare disease arm 
project (http://www.uklOk.org/studies/rarediseases.html), were 
used as controls. None of these subjects had disorders known 
to affect plasma lipid levels. 

Whole exome sequencing 

Genomic DNA (1-3 jig), extracted from blood, 22 was sheared to 
100-400 bp using a Covaris E210 or LE220 (Covaris, Woburn, 
Massachusetts, USA). Sheared DNA was subjected to Illumina 
paired-end DNA library preparation and enriched for target 
sequences (Agilent Technologies, Santa Clara, CA, USA; 
Human All Exon 50 Mb - ELID S02972011) according to the 
manufacturer's recommendations (Agilent Technologies, Santa 
Clara, CA, USA; SureSelectXT Automated Target Enrichment 
for Illumina Paired-End Multiplexed Sequencing). Enriched 
libraries were sequenced (eight samples over two lines) using 
the HiSeq 2000 platform (Illumina) as paired-end 75 base reads 
according to the manufacturer's protocol. 



Bioinformatic analysis 

To improve raw alignment BAMs for single nucleotide poly- 
morphism (SNP) calling, we realigned around known (1000 
Genomes pilot) indels, and recalibrated base quality scores using 
GATK. BAQ tags were added using samtools calmd. BAMs were 
merged to sample level and duplicates marked using Picard. 
Variants (SNPs and indels) were called on each sample individu- 
ally with both samtools mpileup (0.1.17) and GATK 
UnifedGenotyper (1.3-21), restricted to exon bait regions plus or 
minus a 100 bp window. Various quality filters were applied to 
each of the callsets separately. Calls were then merged, giving 
preference to GATK information when possible. Calls were 
annotated with 1000 Genomes allele frequencies, dbSNP132 
rsIDs and earliest appearance in dbSNP. Functional annotation 
was added using Ensembl Variant Effect Predictor v2.2 against 
Ensembl 64, and included coding consequence predictions, 
Sorting Tolerant From Intolerant (SIFT), PolyPhen and Condel 
annotations, and Genomic Evolutionary Rate Profiling (GERP) 
and Grantham Matrix scores. Variants previously reported by 
the 1000 Genomes project with minor allele frequency higher 
than 0.01 were filtered out. Variants that passed the initial filter- 
ing were compared against 195 non-FH control whole exomes, 
processed using the same pipeline, and only FH unique changes 
were further assessed. Pathogenicity of any private (ie, specific 
to the FH cohort) variant was examined using previous knowl- 
edge and by bioinformatic mutation-prediction tools, which 
included: PolyPhen2, SIFT^ Condel, Mutation Taster, PhyloP and 
Gratham Score algorithm. Sanger sequencing was used to 
confirm presence of any identified predicted pathogenic variant. 

Copy number variants analysis 

The copy number variants (CNV) analysis uses a read depth 
strategy designed to overcome biases associated with sequence 
capture and high-throughput sequencing. This set of tools is 
implemented in the package ExomeDepth (freely available at 
the Comprehensive R Archive Network. 23 

RESULTS 

Overall, the mean read depth for the whole exome sequence 
was 72x, with 78.9% of the exome covered at least 20x; and 
55.8% of the targeted sequence was covered 50x or more. 

LDLR analysis 

The average read depth of LDLR exons varied from 136x for 
exon 12, to 4x for exon 18 (figure 1). Using a 16x coverage 
threshold, which would give a 99% probability of observing a 
rare allele at least three times, all except exons 1 and 18 showed 
adequate coverage. Exons 3 and 4 contain the largest number of 
reported FH-causing mutations, 24 and both these exons were 
well covered (mean depth 92 and 57, respectively). 

As shown in table 1, in 14 out of the 48 samples, a 
FH-unique variant in the LDLR was called, in 11 of these the 
variant has been previously reported to be FH-causing. 25 

All these variants were confirmed to be correct by Sanger 
sequencing of duplicate DNA samples (not shown). The variants 
included five different missense mutations and five nonsense 
mutations. The two novel LDLR variants included: c.695-6_ 
698del and c.l776_1778del (p.(G592del)). The c.695-6_698del is 
predicted to cause a frameshift and a premature stop codon by 
altering LDLR splicing (see online supplementary figure SI). The 
deletion of Glycine at residue 592 is predicted to disrupt pack- 
aging of the LDL-R propeller blades in the epidermal growth 
factor (EGF) domain, which could affect displacement of the 
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Figure 1 The exonic coverage of FH 
genes: APOB, LDLR, LDLRAP1 and 
PCSK9 in a standard box plot (the 
minimum, lower quartile, median, 
upper quartile and maximum for each 
gene exon). The horizontal dashed line 
indicates the 16x coverage, when the 
probability of observing a rare allele at 
least 3 times is 99%. The additional 
Y-axis describes GC content 
(0.4=40%, 0.5=50%, etc) for a given 
exon, shown as rhombuses. 
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ligand from the ligand-binding region (see online supplementary 
figure S2). Of these 14 mutations, all should have been detected 
by our standard screening protocol described in the Materials and 
methods section, except for c.695-6_698del 7 where the change 
was located in the primer sequence used for PCR. 

CNV calling identified one deletion of exons 11 and 12 (c. 
1587-?_1845 + ?del) 7 and two duplications of exons 3-8 (191—? 
_1186+?dup) 7 and exons 13-15 (c.l846-?_2311 + ?dup) 7 as 
shown in figure 2. All CNVs were confirmed by MLPA (see 
online supplementary figure S3). 



APOB analysis 

The mean read depth of APOB exons was 93. Exons 26 and 29 
were covered on average 135x 7 whereas exon 1 was covered 
only once (figure 1). Two individuals were found to carry the 
FH-causing APOB mutation in exon 26 7 the p.(R3527Q). There 
were no nonsense or frameshift mutations observed in the gene 
sequence. One novel non-synonymous variant was found in 
exon 26 of APOB, the p.(A3426V) 7 which was unique to the 
FH cohort. The variant was predicted as Tolerated 7 , 'Benign 7 
and 'Polymorphism 7 by SIFIJ PolyPhen and Mutation Taster 7 
respectively. Four other FH-unique non-synonymous variants 
were observed outside of the ligand-binding domain (see online 
supplementary table SI). The functional impact of these var- 
iants 7 as predicted by PolyPhen/SIFT/Mutation Taster 7 was not 



consistent, and whether or not these are FH-causing is unclear. 
There were no CNVs found within the APOB gene. 

PCSK9 analysis 

The mean read depth of the PCSK9 exons was 23. Of these 7 
only 58% of the gene coding sequence had the mean coverage 
higher than 16 7 whereas exons 1 7 6 7 10 7 11 and 12 were covered 
4x on average (figure 1). Exon 7 7 where the common UK 
FH-causing mutation (c.H20G>'T^ p.(D374Y)) occurs 7 was 
covered 36x. Two novel non- synonymous variants were called 
by the exome sequencing 7 c.1027G>C and c.1028A>C 7 both 
present in the same sample. However 7 despite the high read 
depth (51x and 50x) 7 and the high number of read count for the 
novel alleles (19 and 26) (see online supplementary figure S4) 
the Sanger sequencing did not confirm the variants. There were 
no CNVs observed in PCSK9- 

LDLRAP1 analysis 

The average read depth of LDLRAP1 was 36 with all ; except exons 
1 and 9 covered above the 16x threshold (figure 1). The LDLRAP1 
variant analysis was performed using a homozygosity-based strat- 
egy and the presence of compound heterozygote variants was 
also assessed. There were no homozygous or compound heterozy- 
gous functional changes within the gene in any of the individuals. 
One patient was found to be heterozygous for a known Sicilian/ 
Sardinian ARH mutation, the c.432_433insA 7 p.(A145KfsX26) 7 



Table 1 Summary of pathogenic single nucleotide changes and small deletions/insertions in the FH genes 



Gene Samples (n) 


Nucleotide change 


Functional effect 


Depth 


Quality 


Comments 


LDLR 1 


c.326G>A 


p.(C109Y) 


43 


506 


known FH mutation 




c.1690A>C 


p.(N564H) 


36 


343 


known FH mutation 




c.1823C>T 


p.(P608L) 


82 


1214 


known FH mutation 




c.2054C>T 


p.(P685L) 


20 


135 


known FH mutation 




c.2479G>A 


p.(V827l) 


65 


749 


known FH mutation 




c.682G>T 


p.(E228X) 


13 


155 


known FH mutation 




C.1048OT 


p.(R350X) 


60 


816 


known FH mutation 




c.1150C>T 


p.(Q384X) 


20 


275 


known FH mutation 




c.1685G>A 


p.(W562X) 


41 


701 


known FH mutation 




c.2140+1G>A 


Splicing 


22 


258 


known FH mutation 




c.695-6 698del 


Splicing 


36 


1543 


novel 




c.1776_1778del 


p.(G592del) 


148 


2634 


novel 


APOB 1 


c.10277G>A 


p.(A3426V) 


192 


2785 


novel 




c.10580C>T 


p.(R3527Q) 


161 


2144 


known FH mutation 


PCSK9 1 


c.1027G>C 


p.(D343H) 


51 


44 


false positive 




c.1028A>C 


p.(D343A) 


50 


201 


false positive 


LDLRAP1 1 


c.432_433insA 


p.(A145KfsX26) 


90 


2186 


heterozygous 



'Depth' refers to the coverage depth; 'Quality' values are Phred-like quality scores generated by SAMtools. 
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Figure 2 Copy Number Variants (CNVs) in /.£//./? gene. A: 
heterozygous duplication of exons 3-8. B: heterozygous deletion of 
exons 11 and 12. C: heterozygous duplication of exons 13-15. All 
identified by ExomeDepth in the exome sequencing data. The 
crosses show the ratio of observed/expected number of reads for 
the test sample. The grey shaded region shows the estimated 99% 
CI for this observed ratio in the absence of CNV call. The presence 
of contiguous exons with read count ratio located outside of the CI 
is indicative of a heterozygous deletion or duplication in a sample. 
Exons 1 and 18 were excluded from the analysis (not shown on the 
graph) as they did not reach the threshold of 100 for the total 
number of reads. All CNVs were confirmed by MLPA experiment 
(see online supplementary figure S3). 



which is a frameshift mutation resulting in a truncated peptide 
formation. 11 26 Further analysis of this sample showed no other 
pathogenic variants in known FH genes 7 which could contribute 
to the phenotype. CNV calling did not detect large rearrange- 
ments in the LDLRAP1. 

DISCUSSION 

Current FH-screening methods 

The exome sequencing results exposed sensitivity problems 
with the current FH mutation-screening methods used in our 
research laboratory Overall 7 the standard variant-detection 
process already in place (HRM 7 MLPA and Sanger sequencing) 
did not detect 17 LDLR mutations (including 3 CNVs) and 2 
APOB mutations. Although the HRM has proved to be efficient 
at detecting FH variants/ 6 its sensitivity decreases in some 
gene regions 7 depending on the nucleotide composition of the 
fragment. Re-examining previous results for the samples with a 
LDLR or APOB variant called by the NGS 7 we observed that 
most of the variants showed a melting curve shift during the 
HRM assay but Sanger sequencing of the identified gene region 
did not detect any heterozygous changes in the sequence 
despite being repeated several times (i.e. only the predicted 
wild-type sequence was obtained). After the exome sequencing 
the Sanger sequencing was repeated on a duplicate DNA 
sample 7 and the predicted mutations were confirmed to be 
present, validating the exome sequencing and variant calling. 
Although Sanger sequencing is considered to be the gold stand- 
ard mutation-detection method, a combination of PCR arte- 
facts and the human error aspect in the protocol appears to be 
the main reason for the false negative calling in the original 
screening. 

Novel LDLR variants 

Two novel variants in LDLR were identified, a deletion of 10 bp 
on boundary of intron 4 and exon 5 7 which is predicted to 
cause a frameshift resulting in a premature stop codon by alter- 
ing LDLR splicing, and a three bp deletion which deletes 
Glycine at residue 592, which is predicted to disrupt packaging 
of the LDL-R propeller blades in the EGF domain. Neither of 
these variants are found in dbSNP, the 1000 Genomes or the 
NHLBI Exome Sequencing Project, and are highly likely to be 
FH-causing 7 although further work is required to confirm this. 

Novel APOB variants 

APOB codes for one of the largest human proteins, which is the 
major component of the LDL-C responsible for binding to 
the LDL receptor. 27 The actual binding site for the receptor, the 
B-site (residues 3386-3396), has been mapped to a region 
encoded by exon 26 of the APOB, which is the longest coding 
exon known (7572 bps). 28 In addition, the C-terminus encoded 
by exon 29 of the gene was proposed to function as a modula- 
tor of the receptor binding. 28 Therefore, our variant analysis 
strategy prioritised novel variants located in exons 26 and 29 of 
the gene, as these are more likely to cause the FH phenotype. 
In this study, there was only one novel variant identified in the 
exon 26 of APOB, the c.10277G>A (p.(A3426V)), which was 
not observed previously by the dbSNP, the 1000 Genomes or 
the NHLBI Exome Sequencing Project. The variant was not 
present in the 195 non-FH exomes from the UK10K project, 
which were processed using the same pipeline, increasing the 
likelihood that it is in fact disease-causing. The novel p. 
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(A3426V) variant is located near to the LDL receptor-binding 
site (B-site) 7 and close to the known FH mutation p.(R3527Q) 7 
and although it does not alter the charge at the site 7 it may 
produce a conformational change affecting the LDL-R/ApoB 
interaction. This requires further experiments since the current 
in silico prediction tools are not able to assess protein-protein 
interactions. We will also examine whether or not the variant 
cosegregates with the disease. Four other novel APOB variants 
were identified in this group of patients in the N-terminal part 
of the protein. Although these variants are less likely to influ- 
ence LDL clearance from the blood 7 since the N-terminal region 
of the protein is not involved in interacting with the LDL-R 7 
some of the variants are predicted as damaging by Polyphen or 
SIFT The aim of this study was to assess the clinical utility of 
exome sequencing as a sensitive mutation detection tool, rather 
than finding novel FH mutations. Future work includes the 
assessment of novel identified variants, which will involve 
family cosegregation and functional assays. 

Promoter region analyses 

Most of the sequencing data generated for Mendelian disorders 
are focused on the exome, which constitutes around 1% of the 
whole human genome. Prediction tools for the analysis of non- 
synonymous changes are well established and widely used to 
estimate the deleteriousness of amino acid changes. However, 
since the majority of human variations are located in the non- 
coding regions, 29 concern about the bias towards variants in 
the protein-coding sequence was highlighted. 30 Proving the 
pathogenic effect of promoter variants requires use of func- 
tional assays. To date, there are 13 LDLR promoter variants pre- 
dicted to be causal (in revision 10 ). Disappointingly, but not 
surprisingly, given they were not targeted, the exome sequen- 
cing data generated by the SureSelect Human All Exon 
(Agilent) assay, had negligible coverage of the gene promoter 
regions, which can lead to false negative conclusions. Further 
updates of the human exome capture assay should include 
coverage of the LDLR promoter sequences, which can cause 
autosomal dominant disease by altering gene regulation. 

Exome sequencing 

The SureSelect Human All Exons capture assay is a standard 
product, which proved to be efficient at detecting mutations 
within the LDLR and the APOB genes. In this sample, 78.9% of 
exome bases were covered at least 20 times, which is in line 
with the product description -80%. For both LDLR and APOB, 
the majority of the coding sequence was covered more than the 
16x threshold to achieve an estimated 99% chance of seeing a 
real variant (present in a heterozygous state) of at least 3 
times, and overall 19 mutations, were found by high- 
throughput DNA sequencing, which had been missed by con- 
ventional methods in our research laboratory. This indicates 
increased sensitivity for NGS, which can be due both to the 
method used and to the reduced human intervention, and the 
highly automated protocol. However, as with many PCR-based 
methods, exome sequencing has some limitations when it 
comes to amplification of highly repetitive regions or sequences 
rich in GC content. A highly significant negative correlation 
between the G/C content and the exome depth was observed 
in the FH genes (p=4.9xl0 -14 ), as shown in the online supple- 
mentary figure S5. Specifically, only 58% of the PCSK9 gene 
was covered more than 16x, producing unreliable results for 
variant calling in a significant proportion of the gene's coding 
region. As a result, the two novel non-synonymous PCSK9 var- 
iants called by the exome sequencing were not confirmed by 



the capillary sequencing, suggesting a high rate of false positive 
calls when the coverage is poor. If a read depth threshold of 
30x was considered to be required for complete certainty of 
variant calling, at which the sensitivity to detect heterozygous 
variants was shown to be 100%, 31 exons that would be insuffi- 
ciently covered would also include exons 1, 14 and 18 of LDLR, 
exons 2 and 5 of PCSK9, and exons 2, 3 and 7 of LDLRAP1 . 
Thus, although the quality of the produced data is good, valid- 
ation of called variants in poorly covered regions is still neces- 
sary. Applying more stringent filters to the raw data increases 
the specificity of the calling. However, it may also lead to false 
negative results, since not all of the exome 7 s regions are equally 
covered. Newer versions of the SureSelect assay show markedly 
improved coverage of exons that were previously poorly covered 
(unpublished data), so we can expect the sensitivity of exome 
sequencing to improve. 

The Agilent SureSelect assay was efficient in capturing the 
exon-intron junctions, covering on average 80-1 00bps of the 
intronic regions. This was an advantage over our current screen- 
ing protocol, and enabled us to detect a novel variant, the 
c.695-6_698del, which is partially positioned on the annealing 
site for the sequencing primer routinely used in our lab. 

The methodology behind the ExomeDepth package 23 proved 
to be robust, and enabled us to use the exome data, which are 
composed of short sequence reads for exonic regions, to detect 
large gene rearrangements, which are known in the LDLR to be 
usually due to intronic Alu sequence misparing. 32 33 The 
method was shown to allow identification of heterozygous 
CNVs within the LDLR gene, which were missed by the cur- 
rently used MLPA. However, in order to maximise the sensitiv- 
ity and to minimise the noise created by technical variability 
between samples, CNV analysis by Exome Depth requires 
quality data of well-matched exomes (>6 samples), that is, 
sequenced under the exact same conditions. 

The greater time efficiency of the exome sequencing is a signifi- 
cant advantage over the current screening methods. Although 
each called variant currently needs to be individually confirmed 
by Sanger sequencing before a mutation report can be prepared, 
analysing a number of patients in parallel in a short period of 
time is likely to be an efficient way forward for screening of het- 
erogeneous FH patients. More importantly, limited use of manual 
checks and human intervention reduce the issues of possible 
human error. The cost efficiency of NGS is also increasing. The 
development of novel approaches of gene-targeted sequencing, 
using Illumina MiSeq platform, reduces not only the costs of 
sequencing itself but also the time spent on data analysis and 
computer storage requirements. The possibility of designing 
custom amplicons for each disease, recently offered by Illumina 
TruSeq Custom Amplicon or Agilent HaloPlex products, will also 
improve the capture of promoters and other regulatory regions, 
which could be omitted in whole exome eequencing. 
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